首页> 外文OA文献 >Kannada Spell Checker with Sandhi Splitter
【2h】

Kannada Spell Checker with Sandhi Splitter

机译:Kannada拼写检查与sandhi splitter

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

Spelling errors are introduced in text either during typing, or when the userdoes not know the correct phoneme or grapheme. If a language contains complexwords like sandhi where two or more morphemes join based on some rules, spellchecking becomes very tedious. In such situations, having a spell checker withsandhi splitter which alerts the user by flagging the errors and providingsuggestions is very useful. A novel algorithm of sandhi splitting is proposedin this paper. The sandhi splitter can split about 7000 most common sandhiwords in Kannada language used as test samples. The sandhi splitter wasintegrated with a Kannada spell checker and a mechanism for generatingsuggestions was added. A comprehensive, platform independent, standalone spellchecker with sandhi splitter application software was thus developed and testedextensively for its efficiency and correctness. A comparative analysis of thisspell checker with sandhi splitter was made and results concluded that theKannada spell checker with sandhi splitter has an improved performance. It istwice as fast, 200 times more space efficient, and it is 90% accurate in caseof complex nouns and 50% accurate for complex verbs. Such a spell checker withsandhi splitter will be of foremost significance in machine translationsystems, voice processing, etc. This is the first sandhi splitter in Kannadaand the advantage of the novel algorithm is that, it can be extended to allIndian languages.
机译:拼写错误是在键入过程中或用户不知道正确的音素或字素时引入的。如果一种语言包含诸如sandhi之类的复杂词,其中两个或多个词素根据某些规则结合在一起,则拼写检查将变得非常乏味。在这种情况下,使用带有三叉戟拆分器的拼写检查器通过标记错误并提供建议来警告用户是非常有用的。提出了一种新的桑迪分裂算法。 Sandhi拆分器可以拆分大约7000个以Kannada语言用作测试样本的最常用sandhiword。 Sandhi拆分器与Kannada拼写检查器集成在一起,并添加了生成建议的机制。因此,开发了一个全面的,独立于平台的独立拼写检查器以及Sandhi Splitter应用程序软件,并对其效率和正确性进行了广泛的测试。对带有三菱分离器的该拼写检查器进行了比较分析,结果得出结论,带有三菱分离器的Kannada拼写检查器具有改进的性能。它的速度快两倍,空间效率提高200倍,在复数名词的情况下,准确率达到90%,在复数动词的情况下达到50%。这种带有三菱分割器的拼写检查器在机器翻译系统,语音处理等方面将具有最重要的意义。这是坎纳达语中的第一个三菱分割器,该新颖算法的优点在于,它可以扩展到所有印度语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号